Understanding the Concentration Dependence of Viral Capsid Assembly Kinetics - the Origin of the 

Lag Time and Identifying the Critical Nucleus Size 
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The kinetics for the assembly of viral proteins into a population of capsids can be measured in vitro with size 
exclusion chromatography or dynamic light scattering, but extracting mechanistic information from these studies 
is challenging. For example, it is not straightforward to determine the critical nucleus size or the elongation time 
(the time required for a nucleated partial capsid to grow completion). We show that, for two theoretical models of 
capsid assembly, the critical nucleus size can be determined from the concentration dependence of the assembly 
reaction half-life and the elongation time is revealed by the length of the lag phase. Furthermore, we find that 
the system becomes kinetically trapped when nucleation becomes fast compared to elongation. Implications of 
this constraint for determining elongation mechanisms from experimental assembly data are discussed. 



I. INTRODUCTION 

The assembly of protein building blocks into a shell or 'cap- 
sid' is essential for viral replication and thus understanding 
the mechanisms by which assembly proceeds could identify 
targets or opportunities for novel antiviral therapies. How- 
ever, despite extraordinary progress in determining the struc- 
tures of assembled capsids, assembly mechanisms for most 
viruses remain poorly understood because the structures of 
transient assembly intermediates have been inaccessible ex- 
perimentally. The kinetics for spontaneous capsid assembly 
in vitro have been measured with size exclusion chromatogra- 
phy (SEC) and dynamic light scattering (e.g. lli|2l|3l|4l|5llal), 
but extracting mechanistic information such as the critical nu- 
cleus size or the time to assemble an individual capsid has 
been challenging. In this article, we theoretically examine two 
models for capsid assembly kinetics to show that these prop- 
erties can be determined from the concentration dependence 
of median assembly times and the lag phase. 

Assembly kinetics in vitro have been measured for a num- 
ber of icosahedral viruses (e.g. OiQIllBlllI^) and demon- 
strate sigmoidal growth characterized by a lag phase, rapid 
growth, and finally saturation (see Fig. [T]). Zlotnick and 
coworkers 10, 0, S showed that partial capsid intermediates 
assemble during the lag phase, but it has often been assumed 
that the duration of the lag phase corresponds to the time re- 
quired for the concentration of critical nuclei to reach steady 
state, in analogy to models of actin nucleation. However, in 
this work we show that because light scatter signal measures 
the mass-averaged molecular weight of assemblages |4] and 
SEC usually monitors complete capsids, the length of the lag 
phase is related to the elongation time, or the time required for 
a nucleated partial capsid to grow to completion. Similarly, 
the critical nucleus size cannot be reliably determined from 
the concentration dependence of initial or maximum growth 
rates |0], and a method to do so using the extent of assembly 
|@] is data-intensive. We demonstrate that the critical nucleus 
size can be identified in a straightforward manner from the 
concentration dependence of the median assembly time. Fi- 
nally, we show that the system becomes kinetically trapped 
when the elongation time becomes long compared to the time 
scale for nucleation. 



In a contemporary article related to the current study, Mo- 
rozov and coworkers |9] consider simplified capsid assembly 
models in which nucleation occurs via a single dimerization 
event, which enables a elegant analytic solution. They show 
that the early phase of assembly can be characterized as a 
shock front and that for some conditions prohibitively long 
time scales are required to reach equilibrium. In this work we 
consider nucleation as an explicit multistep subunit addition 
process, with the objectives of understanding the concentra- 
tion dependence of overall assembly times and learning how 
nucleation and capsid growth times can be inferred from ex- 
perimental light scatter measurements. 



II. MODELS 

Zlotnick and coworkers |0, @] have developed a system of 
rate equations that describe the time evolution of concentra- 
tions of empty capsid intermediates 
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where N is the number of subunits in a complete capsid, Ci 
is the concentration of intermediates with i subunits, fi is the 
subunit association rate constant for intermediate i, which is 
related to the dissociation rate constant by detailed balance 
bi — f exp{Agi/kBT)/vo, with Agi the change in free en- 
ergy due to association of the subunit, and vq is the standard 
state volume. Following Ref. [8], transitions between inter- 
mediates are only allowed through binding or unbinding of a 
single subunit and there is only one intermediate for each size 
i. 

We consider two models for intermediate free energies and 
association rates. In the simple nucleation and growth model 
(henceforth referred to as 'NG')|'3l, the association rate con- 
stant / is independent of intermediate size and association free 
energies are given by Agi = g^uc before nucleation (i < rinuc) 
and Agi ~ .geiong during elongation (rinuc ^ i < N ~ 1), 
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where nnuc is the critical nucleus size. Since insertion of 
the final subunit generates the most new contacts, we set 
Agijv-i = 2(7eiong; our conclusions would be the same for an 
irreversible final assembly step. Although simple, this model 
has been shown to reproduce experimental assembly kinetics 
for several viruses 12i|5|]. Nucleation is usually assumed to 
correspond to completion of a polygon (e.g. a pentamer of 
dimers for CCMV |3] or a trimer of dimers for turnip crinkle 
virus llOtl) and could include intertwining of flexible terminal 
arms as well as subunit conformation changes. In this work 
we use belong = 2gnuc- In Zlotnick and coworkers' formula- 
tion, the nucleation and elongation phases are distinguished 
by having different forward rate constants rather than dif- 
ferent association free energies, but the current presentation 
yields similar behavior and seems easier to justify physically. 
For simplicity, we neglect geometrical statistical factors (si in 
Ref. §]). 

We also consider a model for capsid assembly based on 
classical nucleation theory (henceforth referred to as "CNT") 
suggested by the Zandi and coworkers Hill , in which the un- 
satisfied subunit-subunit interactions in a partial capsid inter- 
mediate are represented by a line tension a, and the binding 
free energy is 



Gi = igc + crh 



(2) 



with the binding free energy per subunit in a complete 
capsid, k = 2{-K/NY/'^[i{N - i)]^/"^ as the number of sub- 
units on the perimeter of the partial capsid and a = —gc/2 
ifTHl . The time evolution of intermediate concentrations is 
solved using Eq. [1] with the rate constants related by bi = 
fi exp[(G'i — Gi-i)/ksT]/vo, and association rate constants 
are proportional to the perimeter length, fi = fU. This model 
seems more realistic than the NG model for later stages of 
capsid assembly, but additional complexity may be required 
to properly describe nucleation or formation of the first poly- 
gon. 

We note that both models neglect the possibility of capsids 
with malformed or non-native structures, which are found for 
sor ne p arameter sets in dynamical simulations |[l2 , 1 3 .[Tsl 
[MoSIH, and association of intermediatesllJMHlf. 
We integrate the system of differential equations (Eq. [1]) nu- 
merically, with the initial condition ci = cq, where cq is the 
total subunit concentration. 



m. ESTIMATING TIME SCALES FOR NUCLEATION 
AND ELONGATION 

It is useful to write the overall capsid assembly time t as 
T — Tnuc + belong with Tnuc and Tgiong the average times for 
nucleation and growth, respectively. If we assume the con- 
centration of free subunits is constant during the assembly of 
a given capsid, the average time for a capsid to complete the 
elongation phase in the NG model can be calculated from the 
mean first passage time for a biased random walk with a re- 
flecting boundary conditions at rinuc and absorbing boundary 
conditions at N, with forward and reverse hopping rates given 



by /ci and 6eiong = / exp(5eiong)/uo, respectively l|22,|23|] 
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In the limit of fci 3> 6eiong Eq. |3]can be approximated to give 
belong ~ {N — n„uc)/ fci, while similar forward and reverse 
reaction rates, fci w 6eiong, gives ieiong « (iV - nnuc)^/2/ci. 
We will see that when elongation is fast compared to nucle- 
ation, the duration of the lag phase in capsid completion mea- 
surements is given by ri^g = Tgiong calculated from Eq.[3]with 
ci = Co. 

Under conditions of constant free subunit concentration, 
we could derive the average nucleation time with an equation 
analogous to Eq.[3] |8i|23l1 
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(4) 



Because free subunits are depleted by capsid nucleation and 
growth during spontaneous capsid assembly, however, the 
nucleation rate never reaches this value and net nucleation 
asymptotically approaches zero as the concentration of com- 
pleted capsids approaches its equilibrium value. Instead, treat- 
ing the system as a two-state reaction with nnuc-th order kinet- 
ics yields an approximation for the median assembly time ri/2, 
the time at which the reaction is 50% complete 



T\I2 = 



1 peq 



Nf 



exp (Gfi/ZcBT) Cq' 



(5) 



with n = rinuc — 1, — [n ~ l).gnuc for the NG model or 
given by Eq. |2]for the CNT model, and as the equilib- 
rium fraction of subunits in complete capsids, which can be 
calculated from the law of mass action flSi 12411 . For condi- 
tions under which most assembled subunits are in complete 
capsids, the equilibrium completion fraction is given by [25| 
PnV[^(1 - ^'n'')"] = exp(-GAr/A:Br) with Gat as the total 
binding free energy of a complete capsid. The factor of N^"^ 
in Eq.|5] accounts for the fact that N subunits are depleted by 
each assembled capsid. 

When capsid growth times are negligible compared to nu- 
cleation times, the expressions Eq. [3] and Eq. |5] respectively 
predict the duration of the lag phase and the overall median 
assembly time. However, as first noted by Zlotnick [7] the 
reaction becomes kinetically trapped if free subunits are de- 
pleted before most capsids finish assembling. It was recently 
suggested 19. 12311 that this trap occurs at parameters Ag and cq 
for which the rate of subunit depletion by nucleation {N/ r™") 
is equal to the elongation rate. We find that Eqs.|3]and |5]begin 
to fail a crossover concentration Cc for which initial nucleation 
and elongation rates are equal, but the system becomes kineti- 
cally trapped at a larger concentration Ckt defined by the point 
at which the median assembly time ti/2 matches the elonga- 
tion time. These concentrations are related to binding free 
energies and other parameters by 
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nuc / 
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for Co = Cc 
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FIG. 1: The time dependence of capsid assembly varies witli ini- 
tial subunit concentration cq. (a) The completion fraction Pm as a 
function of time for indicated initial subunit concentrations, (b) The 
calculated light scatter closely tracks completion fraction until ki- 
netic trapping sets in. The calculated light scatter (dashed lines) and 
completion fraction (solid lines) are shown as a functions of time (on 
a logarithmic scale) for indicated initial subunit concentrations, with 
5„uc = -7fcBT, iV = 120 and / = 10^ M-^s'V 

with T™" and ti/2 respectively given by Eq.|4]and Eq.|5] 



IV. NUMERICAL RESULTS 

We explore the concentration dependence of assembly ki- 
netics by numerically integrating the system of rate equations 
(Eq. [Til over a range of initial subunit concentrations cq. We 
start with the NG model, with representative parameters: the 
contact free energy ^nuc — Tk^T (w 4 kcal/mol) ll26ll . capsid 
size = 120 corresponding to 120 dimer subunits in hep- 
atitis B virus [26], the critical nucleus size rinuc = 5, and the 
subunit association rate constant / = 10^ M^^s^^ JSl]. 

The fraction of subunits in complete capsids Pn, which can 
be monitored by SEC, is shown as a function of time for sev- 
eral initial subunit concentrations cq in Fig. [T] In all cases 
there is a lag time while intermediates assemble, followed by 
rapid appearance of complete capsids and then eventually sat- 
uration of growth. The rate of capsid formation is nonmono- 
tonic with respect to initial subunit concentration; as antici- 
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FIG. 2: The end of the lag phase is measured by making a linear fit to 
the assembly kinetics trace at the point of maximal growth rate (■). 
The lag time (+) then corresponds to time at which the fit (dashed 
line) crosses the baseline. Plots are shown for co = 20/iM. 



pated by Eq.|6]a kinetic trap occurs for subunit concentrations 
larger than c^t ~ 60/iM in which the growth of nucleated par- 
tial capsids is stymied because the system rapidly becomes 
starved for free subunits 10, Si ■ 

We calculate light scatter signal /ls from the mass-averaged 
molecular weight of assemblages |2]. Since light scatter units 
are arbitrary, we normalize calculated light scatter by N to 
give 1 if all subunits are in complete capsids (P^ = 1). As 
shown in Fig. [TJ), the calculated light scatter closely tracks 
the completion fraction for initial subunit concentrations be- 
low Cc ~ 38/iM, since the majority of assembled subunits 
are found in complete capsids once the lag phase is complete. 
This correspondence completely breaks down by Ckt, when 
there are significant concentrations of partial-capsid interme- 
diates. 

The concentration-dependence of median assembly 
times and lag times. The median assembly times (reaction 
half-lives), and lag times numerically calculated for the nucle- 
ation and growth model with respect to the completion frac- 
tion Pn and calculated light scatter are given in Fig. |3] As 
illustrated in Fig. |2] lag times are extracted from numerical 
solutions for each parameter set with a linear fit to the assem- 
bly trace at the point of maximal assembly rate {dP^ / dt or 
diLs/dt). The lag time is given by the intersection of the 
linear fit with the baseline value, which is for the com- 
pletion fraction and roughly 1/A^ for calculated light scatter. 
Note that for initial subunit concentrations below lag times 
for both the completion fraction and calculated light scatter 
are inversely proportional to cq, and the mean first passage 
time estimate (Eq. O for the capsid elongation phase Teiong 
closely predicts the lag time for completion fractions. The lag 
time for light scatter is shorter than for the completion frac- 
tion because signal is integrated over all of assemblages, but 
demonstrates the same scaling. Hence, for this model, lag 
times measured for Pn (SEC) or light scattering are associ- 
ated with the elongation phase, or the time required to build a 
complete capsid. This correspondence begins to break down 
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FIG. 3: The median assembly times ri/2 (a) and the lag times ring 
(b) calculated from the completion fraction (Pn) and calculated light 
scatter (Jls) are shown as functions of initial subunit concentration 
cq. The estimates for the nucleation time (Eq.[5]with n„„c = 5) and 
lag time Eq. [3]are shown as dashed lines, and the estimates for the 
crossover concentration Cc and kinetic trap concentration Ckt ( Eq.|6} 
are shown as symbols on the estimated nucleation curve. 



at the crossover concentration Cc (Eq.|6]l, when the concentra- 
tion of free subunits is depleted before the first capsids finish 
assembling. Above this point, the molecular weight average 
growth rate becomes dominated by dimerization and hence 
varies inversely with the square of initial subunit concentra- 
tion. The relationship between the lag time and the elongation 
phase of capsid assembly discovered here explains the obser- 
vation of Endres and Zlotnick I's^ that the duration of lag time 
is proportional to the elongation forward rate constant. 

The reaction half-life (median assembly time) T1/2 is given 
by Pn('''i/2) = O.SPj^'' for the completion fraction, or the anal- 
ogous relation for calculated light scatter. Below q half-lives 
measured with light scatter and completion fraction agree 
quantitatively, as anticipated from Fig. [TJ), and agree closely 
with the two-state nucleation kinetics estimate (Eq. |5]). In 
Fig. [3^ the numerical and theoretical half-lives are normal- 
ized by the equilibrium P^^ to emphasize that the scaling with 
concentration identifies the critical nucleus size: Tia/P^'* cx 
^nnuc-i rpjjg f^^j. j-jj^j. assembly times can be predicted from 
nucleation kinetics alone can be understood by noting that the 



FIG. 4: The maximum assembly rates calculated from the comple- 
tion fraction (Pn) and calculated light scatter (Jls) are shown as func- 
tions of initial subunit concentration cq. The theoretical estimate for 
the maximum nucleation rate (Eq. |4] with n„uc = 5) is shown as a 
dashed line. 



elongation time is negligible compared to the overall assembly 
time below Cc. The close correspondence between the theoret- 
ical and numerical median assembly times below Cc suggests 
that this quantity may provide a simple alternative to the crit- 
ical nucleus estimator presented in the appendix of Ref. |8]. 

As anticipated, the kinetic trap point c^t roughly corre- 
sponds to the point at which the time to build the capsid be- 
comes longer than the average nucleation time (Eq. |6|. As- 
sembly eventually occurs above c^t as large partial capsids 
scavenge subunits from smaller intermediates. However, we 
caution that alternative capsid morphologies |5] and/or mal- 
formed capsids iTsl [Till , which are not considered in these 
models, may occur at binding free energies and subunit con- 
centrations above Ckt. 

As evident from Eq. |5] the critical nucleus size can also 
be identified by evaluating lnTi/2/PN as a function of the 
subunit-subunit association free energy gnuc- We find that 
agreement between the theoretical predictions and numerical 
results is insensitive to gnuc and iinuc- 

Maximum assembly rates. As noted by Endres and Zlot- 
nick Igf], it is not possible to relate the critical nucleus size to 
initial assembly rates due to the presence of the lag phase. At 
low concentrations, though, the maximum assembly rates ap- 
proach the initial nucleation rate given in Eq. ID as shown in 
Fig- SI and thus nearly scale with Cg°"^^. However, because 
the maximum assembly rates deviate from the theoretical pre- 
diction well below the crossover concentration q, it appears 
that median assembly times are a more robust predictor of the 
critical nucleus size. 

The classical nucleation model. To evaluate if our conclu- 
sions are model dependent, we also consider the concentration 
dependence of assembly times and lag times for the classical 
nucleation model (CNT, Eq. |2]i. To facilitate comparison of 
the two models, we set gc = ISSksT and / = 10^/ X)i!=i 
so that P^'^ and the average association rate constant are the 
same for both models. As for the NG model, the calcu- 
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FIG. 5: The median assembly time (■) and lag time (+) calculated 
from the completion time and the lag time for light scatter (A) for the 
classical nucleation model are shown as functions of concentration. 
The theoretical prediction for lag time Eq. [5] is shown as a dashed 
line. 



lated light scatter closely tracks the completion fraction be- 
low the kinetic trap point, and as shown in Fig. |5] the lag 
phase for both quantities lies close to the random walk esti- 
mate for the elongation time (Eq. O. The most significant 
difference between the two models is that the critical nu- 
cleus size in the classical nucleation model varies with sub- 
unit concentration HI: Unuc = 0.5iV(l - T{T'^ + ly/^ with 
r = [(7c — ln(cii;o)]/cr with vq the standard state volume, 
which gives 9 < nnuc ^ 4 for the simulated range of initial 
subunit concentrations. The reaction half life, however, ap- 
pears to scale roughly as ti/2/Pn oc Cq ^ because the effective 
critical nucleus size increases over the course of the reaction 
as free subunits are depleted. 

For all parameter sets we considered, the CNT model 
demonstrates a large effective critical nucleus size as the con- 
centration is reduced below Cc. Hence, analyzing the concen- 
tration dependence of experimental median assembly times 
could be one way to evaluate which of the CNT model or NG 
model better represents capsid assembly mechanisms. 

The slow approach to equiUbrium. We note that accu- 
rate estimation of P^^ is not necessary to obtain the critical 
nucleus size from ti/2, since nucleation rates have such a dra- 
matic concentration dependence even at subunit concentra- 
tions for which P^'^ > 0.5. This observation could be impor- 
tant, since the median assembly times shown in Figs.[3h and|5] 
extend beyond experimental feasibility at low concentrations 
(see Fig. |6l), particularly for the CNT model because of the 
large effective critical nucleus size. Furthermore, as noted by 
Zlotnick 101, capsid assembly approaches equilibrium asymp- 
totically. Therefore, when estimating P^^ it is important to 
plot PN{t) on a logarithmic scale to judge equilibration, and it 
may be necessary to extract the contact free energy gnuc or Oc 
from fits to kinetic data over a series of concentrations 
The slow approach to equilibrium due to nucleation barriers 
was noted from simulations in Ref. Il3j] and is discussed for 
the classical nucleation model in Ref. J^]. 




FIG. 6: Completion fractions at indicated times are shown as func- 
tions of initial subunit concentration cq for (a) the nucleation and 
growth model and (b) the classical nucleation model. In (a) and (b), 
the kinetic trap points Ckt corresponds to the maximum in Pn with 
respect to co at t = 2000. 



V. DISCUSSION AND OUTLOOK 

In this work we examine two theoretical models for capsid 
assembly, for which we find that the duration of the lag phase 
measured by SEC or dynamic light scatter is related to the 
time for a nucleated capsid to grow to completion, and hence 
scales inversely with initial subunit concentration. When nu- 
cleation of new partial capsids is faster than this growth time, 
the system becomes kinetically trapped due to starvation of 
free subunits, meaning that capsid formation rates decreases 
with increasing initial subunit concentration. If there is a well- 
defined critical nucleus size, it can be identified from the scal- 
ing of the median assembly time with respect to initial subunit 
concentration. Although highly simplified, the nucleation and 
growth model we consider here has been shown to closely 
resemble experimental capsid assembly data 10, Ht] and our 
findings therefore suggest new approaches to analyzing and 
interpreting experimental data to characterize assembly mech- 
anisms. 

These predictions have important implications for obtain- 
ing mechanistic information about capsid elongation (growth 
after nucleation) from bulk in vitro assembly kinetics exper- 
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iments. The fact that overall reaction times are closely pre- 
dicted by an expression based solely on nucleation kinetics ( 
Eq. |5] Fig. [3]) when assembly is most efficient, suggests that 
the lag phase contains the most information about elongation. 
In fact, the preceding analysis demonstrates that the average 
elongation time and hence the average growth velocity dur- 
ing elongation are directly related to the duration of the lag 
phase. Additional mechanistic information about the elonga- 
tion process could be obtained if the distribution of growth 
times could be deconvolved from the distribution of nucle- 
ation times. The kinetic trap criterion, however, limits this 
possibility by constraining growth times and hence the lag 
phase duration to be short compared to nucleation times. 

This constraint could potentially be overcome in several 
ways. The distribution of elongation times can be directly 
measured in experiments that monitor the assembly of individ- 
ual capsids, since the elongation and nucleation phases can be 
separated |27]. For bulk assembly studies, recent theoretical 
studies [23, 28] found that robust assembly is possible under 
conditions of fast heterogeneous nucleation if there is excess 



capsid protein. Thus, experimental systems in which capsid 
assembly is induced by nucleic acids [29], synthetic polymers 
1 30, 31 1, nanoparticles |32l], and portal or scaffolding proteins 
1 33 , 34 , [m could be used to elucidate elongation mechanisms 



(although assembly mechanisms can be influenced by the het- 
erogeneous component ll23il29il36ll ). 

Finally, we note that processes not considered in this work, 
such as transitions between assembly active and assembly in- 
active conformations of free subunits |j^ or hierarchical as- 
sembly It37i1 would add additional complexity to analysis of 
the lag phase. A systematic comparison of model predictions 
with experimental assembly data over a wide range of con- 
centrations could reveal additional features of complexity in 
assembly mechanisms and suggest model improvements. 
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